Improving Name Tagging by Reference Resolution and Relation Detection

نویسندگان

  • Heng Ji
  • Ralph Grishman
چکیده

Information extraction systems incorporate multiple stages of linguistic analysis. Although errors are typically compounded from stage to stage, it is possible to reduce the errors in one stage by harnessing the results of the other stages. We demonstrate this by using the results of coreference analysis and relation extraction to reduce the errors produced by a Chinese name tagger. We use an N-best approach to generate multiple hypotheses and have them re-ranked by subsequent stages of processing. We obtained thereby a reduction of 24% in spurious and incorrect name tags, and a reduction of 14% in missed tags.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Corefrence resolution with deep learning in the Persian Labnguage

Coreference resolution is an advanced issue in natural language processing. Nowadays, due to the extension of social networks, TV channels, news agencies, the Internet, etc. in human life, reading all the contents, analyzing them, and finding a relation between them require time and cost. In the present era, text analysis is performed using various natural language processing techniques, one ...

متن کامل

Rule-based reference resolution for unrestricted text using part-of- speech tagging and noun phrase parsing

This paper describes an experimental syntactic rule-based method for reference resolution in unrestricted texts. References can be resolved automatically and this overcomes a major hurdle in text analysis and provides a key advantage in text `understanding' and information extraction. A shortcoming of systems that locate and extract sentences from unrestricted text to help people assimilate inf...

متن کامل

A New Method for Improving Computational Cost of Open Information Extraction Systems Using Log-Linear Model

Information extraction (IE) is a process of automatically providing a structured representation from an unstructured or semi-structured text. It is a long-standing challenge in natural language processing (NLP) which has been intensified by the increased volume of information and heterogeneity, and non-structured form of it. One of the core information extraction tasks is relation extraction wh...

متن کامل

Definite Description Resolution in Spanish

In this work, a method for the resolution of the identity co-reference produced by definite descriptions in Spanish texts is presented. This method is based on the linguistic knowledge acquired by POS-tagger, synonymous dictionary and relationships between names and verbs resources. This method uses a system of restrictions and preferences in order to obtain the correct antecedent. The method a...

متن کامل

Improvement of Breast Cancer Detection Using Non-subsampled Contourlet Transform and Super-Resolution Technique in Mammographic Images

Introduction Breast cancer is one of the most life-threatening conditions among women. Early detection of this disease is the only way to reduce the associated mortality rate. Mammography is a standard method for the early detection of breast cancer. Today, considering the importance of breast cancer detection, computer-aided detection techniques have been employed to increase the quality of ma...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005